Picture for Pinzhen Chen

Pinzhen Chen

When Flores Bloomz Wrong: Cross-Direction Contamination in Machine Translation Evaluation

Add code
Jan 28, 2026
Viaarxiv icon

Deep Learning Superresolution for 7T Knee MR Imaging: Impact on Image Quality and Diagnostic Performance

Add code
Jan 05, 2026
Viaarxiv icon

CHARM: Calibrating Reward Models With Chatbot Arena Scores

Add code
Apr 14, 2025
Viaarxiv icon

XL-Instruct: Synthetic Data for Cross-Lingual Open-Ended Generation

Add code
Mar 29, 2025
Viaarxiv icon

Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning

Add code
Feb 21, 2025
Figure 1 for Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning
Figure 2 for Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning
Figure 3 for Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning
Figure 4 for Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning
Viaarxiv icon

Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

Add code
Sep 26, 2024
Figure 1 for EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Figure 2 for EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Figure 3 for EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Figure 4 for EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Viaarxiv icon

Pitfalls and Outlooks in Using COMET

Add code
Sep 02, 2024
Figure 1 for Pitfalls and Outlooks in Using COMET
Figure 2 for Pitfalls and Outlooks in Using COMET
Figure 3 for Pitfalls and Outlooks in Using COMET
Figure 4 for Pitfalls and Outlooks in Using COMET
Viaarxiv icon

Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation

Add code
Aug 23, 2024
Figure 1 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 2 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 3 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Figure 4 for Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Viaarxiv icon

Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?

Add code
Jun 18, 2024
Figure 1 for Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
Figure 2 for Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
Figure 3 for Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
Figure 4 for Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
Viaarxiv icon